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(57) Abstract 

Normally transcriptionally silent genes in a cell line or microorganism may be activated for expression by inserting a DNA 
regulatory element which is capable of promoting the expression of a normally expressed gene product in that cell or which is 
promiscuous, the regulatory element being inserted so as to be operatively linked with the normally silent gene in question. The 
insertion is accomplished by means of homologous recombination by creating a DNA construct including a segment having a 
DNA segment of the normally silent gene (targeting DNA) and the DNA regulatory element to induce gene transcription. ,The 
technique is also used to modify the expression characteristics of any endogenous gene of a given cell Une or microorganism. 



<WO 91099S5A1> 



• \ 



FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identify States party to the PCX on the front pages of pamphlets publishing international 
applications under the PCT. 



AT 


Austria 


es 


Spain 


MG 


Madagascar 


AU 


Austratia 


Fl 


Finland 


ML 


Mali 


BB 


Barbados 


FR 


France 


MN 


Mongolia 


BE 


Belgium 


GA 


Gabon 


MR 


Mauritania 


BP 


Burkina Faso 


CB 


United Kingdom 


MW 


Malawi 


BC 


Buigarta 


CN 


Guinea 


NL 


Netherlands 


BJ 


Benin 


GR 


Greece 


NO 


Norway 


BR 


Brazil 


HU 


Hungary 


PL 


Poland 


CA 


Canada 


tT 


haly 


RO 


Romania 


CF 


Central African Republic 


JP 


Japan 


SD 


Sudan 


CG 


Congo 


KP 


Democratic People's Republic 


SB 


Sweden 


CH 


Swiceerrand 




of Korea 


SN 


Senegal 


CI 


Cote dMvoiru 


KR 


Republic of Korea 


SU 


Soviet Union 


CM 


Cameroon 


LI 


Liechtenstein 


TD 


Chad 


CS 


Czechoslovakia 


LK 


Sri Lanka 


TC 


Togo 


DE 


Germany 


LU 


Luxembourg 


US 


United Stales of America 


DK 


Denmark 


MC 


Monaco 







I 



wo 91/09955 



-1- 



PCT/US90/07642 



Endogenous gene expression modification with regulatory 
element. 

5 PT-RT.n OF INVENTION 

The present invention relates to a process for 
the modification of the expression characteristics of a 
gene which is naturally present within the genome of a 
stable cell line or cloned microorganism. In the preferred 

10 embodiment, the present invention relates to a process for 
the activation and expression of a gene that is present 
within a stable cell line and normally transcriptionally 
silent or inert. As a result, the -protein product of that 
gene is expressed. This phenomenon occurs without 

15 transfecting the cell with the DNA that encodes the 

product. Rather, the resident gene coding for the desired 
product is identified within a cell and activated by 
inserting an appropriate regulatory segment through a 
technique called homologous recombination. Positive and/ or 

20 negative selectable markers can also be inserted to aid in 
selection of the cells in which proper homologous 
recombination events have occurred. As an additional 
embodiment, a specified gene can be amplified for enhanced 
expression rates, whether that gene" is normally 

25 transcriptionally silent and has been activated by means of 
the present invention, or endogenously expresses product. 

BACKGROUND OF THE INVENTION 
It is well known that each cell within an 
30 organism contains the genetic information that encodes all 
of the proteins found within that organism. However, only 
a very small percentage of the genes present within a given 
cell type is actually transcribed. The intracellular 
mechanisms that regulate the array of genes to be 
35 transcribed are now xinderstood. Cell specific proteins 
present within the nucleus interact with DNA regulatory 



wo 91/09955 



- 2 - 



PCr/LIS90/07642 



ssgments that are linked with particular genes. This 
interaction of nuclear proteins with DNA regulatory 
sequences is required for gene transcription. This results 
in itiRNA biosynthesis and ultimate expression of the encoded 
5 protein (Mitchell and Tjian, Science , 245:371,1989). 

These DNA regulatory segments or elements for 
each gene lie upstream from and, in some cases, within or 
even downstream of the coding regions. Through an 
interaction with cell specific nuclear proteins, DNA 

10 regulatory segments affect the ability of RNA polymerase, 
the rate limit ing^ enzyme in protein expression, to gain 
access to the body of the gene and synthesize a mRNA 
transcript. Thus, these DNA segments and the resident 
nuclear proteins play a critical role in the regulation of 

15 expression of specific genes (Johnson and McKnight, Ann. 
Rev. Biochem. . 58:799, 1989). 

The DNA regulatory segments are binding sites for 
the nuclear proteins. These nuclear proteins attach to the 
DNA helix and apparently alter its structure to make the 

20 desired gene available for RNA polymerase recognition, 

which facilitates gene transcription. The expression of 
these cell specific regulatory proteins determines which 
genes will be transcribed within a cell and the rate at 
which this expression will occur. As an example' of ' the 

25 specificity of this system, pituitary cells but not liver 

cells express pituitary proteins, even though the genes for 
the pituitary proteins are present within all liver cells. 
Nuclei of the liver cells do not contain the specific DNA 
binding proteins which interact with the elements of 

3 0 pituitary genes resident within the liver cells. 

Current Methods Employed to Express Proteins Using 
Recombinant DNA Technolocry 

With the knowledge that specific DNA regulatory 
35 sequences are required to activate gene transcription 

within a particular cell type, scientists have expressed 
foreign genes within a particular cell type through genetic 
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engineering. In general, DNA regulatory segments that are 
recognized by the cell's nuclear proteins are placed 
upstream from the coding region of a foreign gene to be 
expressed. In this way, after insertion into the cell, 
5 foreign DNA may be expressed since the cell's nuclear 
regulatory proteins now recognize these DNA regulatory 
sequences. This technology has been employed to produce 
proteins that have been difficult to obtain or purify from 
natural sources by traditional purification strategies. 

10 In addition to the recognizable DNA sequences and 

the gene of interest, a selectable marker is attached to 
the DNA construction. In this way, only the cells that 
have taken up the DNA survive following culture in a 
selectable medium. For example, the gene for neomycin 

15 resistance may be included in the expression vector. 

Following transf ection, cells are cultured in G418, a 
neomycin antibiotic that is lethal to mammalian cells. If 
however, the cells have acquired the neomycin resistance 
gene, they will be able to withstand the toxic effects of 

2 0 the drug. In this way, only the cells that have taken up 
the transf ected DNA are maintained in culture* It is 
understood that any selectable marker could be used as long 
as it provided for selection of cells that had taken up the 
transf ected DNA. It is further understood that there is no 

2 5 criticality as to the specific location of the inserted 

genetic material within the cell. It is only important that 
it be taken up somewhere within the nucleus as both the 
regulatory segment and the foreign gene (as well as the 
selectable marker) are inserted together. 

30 

Deficiencies in the Current Methods of Gene Expression 

While the above techniques have been instrumental 
in exploiting the power of genetic engineering, they have 
not always been the most efficient methods to express 

3 5 genes. This is due to the fact that insertion of DNA into 

the nucleus of a cell line is usually accomplished through 
a technique known as transf ection. DNA that has been 
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engineered for expression in the cell line of interest is 
precipitated and the cell membrane is solubilized to allow 
entry of the DNA. As indicated above, the exact site into 
which the DNA incorporates into the genome is never 
predictable; indeed the DNA may remain episomal (not 
integrated into the genome) . This results in the 
unpredictability of both the level of expression of the 
protein produced and the stability of the cell line. 

A second shortcoming of this technique is the 
fact that the construction of the expression vector is 
extremely difficult when the gene of interest is relatively 
large (greater than 5-10 kilobases) . Many of the proteins 
expressed by recombinant DNA technology have been encoded 
by cDNAs rather than much larger genomic clones- This is 
done to reduce the overall size of the insert. While the 
use of cDNAs makes genetic engineering more convenient, 
rates of gene transcription and protein production may 
suffer as a result. It has recently been shown that 
esqjression levels are sometimes greatly enhanced through 
the use of genomic rather than cDNA inserts (Brinster et 

Proc. Natl. Acad. Sci., 85:836-840, 1988, and Chung 

and Perry, Mol. Cell. Biol. . 9:2075- 2082, 1989). 
Although the mechanisms responsible for this observation 
are not well understood, it is known that in certain 
situations enhancer elements present within introns can 
improve the transcriptional efficiency of the gene. There 
is also evidence that introns, or the splicing events which 
result from the presence of introns, may have an effect on 
the RNA processing events which follow the initiation of 
transcription (Buchman and Berg, Mol. Cell. Biol. . 8:4395- 
4405, 1988) . This may stabilize the transcript thereby 
improving the rate of mRNA accumulation. In the above 
cited Brinster et al paper, it is also postulated that the 
position of the introns within the gene may be important 
for phasing of nucleosomes relative to the promoter. The 
influence of various regulatory elements on transcription 
of eukaryotic genes is discussed in Khoury et al, cell . 
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33:313-14 (1983), Maniatis et al. Science , 236:1237-45 

(1987) and Muller et al, Eur . J . Biochem. . 176:485-95 

(1988) , 

Thirdly, to gain entry into the nucleus, the 
5 transfected DNA, including the entire coding region of the 
foreign gene, must traverse the cytoplasm following entry 
through the permeabilized plasma membrane of the cell. 
During that time, the DNA may come in contact with 
lysosomal enzymes which may alter or completely destroy the 
10 integrity of the DNA, Thus, the coding region of the DNA 
may not be identical to that which was transfected. 

The novel method of gene activation and/or 
expression modification that we describe below cannot 
result in the production of mutant forms of the desired 
15 protein, since the coding region of the desired gene is not 
subjected to enzymatic modifications* 

In summary, a large amount of the DNA transfected 
into the cell using traditional techniques, and 
particularly the coding region thereof, will not be 
20 faithfully transcribed. It may be degraded prior to entry 
into the nucleus, enzymatically perturbed so that it will 
not encode the entire desired protein or it may not contain 
all of the necessary regulatory segments to allow for 
transcription. It may be inserted into a portion of the 
25 genome that prevents transcription. If the cDNA is 

transcribed, the protein of interest may not be produced 
efficiently due to the omission of introns which may 
contain enhancers or enable efficient mRNA processing. 
Finally, it may remain episomal, promote protein production 
3 0 but be unstable as the cell population grows through cell 
division. 

It would be most desirable to develop a method of 
induction of gene expression that would produce a cell line 
that has incorporated the positive attributes of the 
35 existing methods but somehow circumvents the unattractive 
features. It would further be desirable to be able to 
express or modify endogenous expression of particular genes 
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in the cell type of choice. It is further desired to be 
able to take advantage of the potential benefits that may- 
be afforded by a complete genomic sequence which may 
include cryptic transcriptional enhancers that may reside 

5 within introns, by appropriate placement of introns for 
proper nucleosome phasing or by more efficient mRNA 
processing events. These advantages are ordinarily not 
enjoyed in recombinant DNA expression methods due to the 
size of the gene. If one were able to express a gene that 

0 is already resident in the genome, i.e., an endogenous 

gene, cell line stability and expression rates would become 
more consistent and predictable. 



SUMMARY OF THE INVENTION 
Accordingly, it is an object of the present 
invention to eliminate the above-noted deficiencies in the 
prior art. 

It is another object of the present invention to 
provide a method of regulation and/or amplification of gene 
expression that incorporates the positive attributes of 
recombinant gene technology but circumvents the 
unattractive features . 

It is a further object of the present invention 
to provide a method for expressing specific 'genes present 
but normally transcriptionally silent in a cell line of 
choice. 

It is yet a further object of the present 
invention to provide a method for expressing proteins which 
takes full advantage of complete genomic sequences that are 
responsible for mRNA accumulation and/or transcription. 

It is still another object of the present 
invention to provide a method of modifying the expression 
characteristics of a gene of interest by inserting DNA 
regulatory segments and/ or amplifying segments into the 
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genome of a stable cell line or cloned microorganism 
upstream of, within, or otherwise proximal to the native 
gene of interest. 

It is still a further object of the present 
5 invention to provide a method for modifying the expression 
characteristics of a gene which is naturally present within 
the genome of a stable cell line or cloned microorganism 
and at the same time insert characteristics which will aid 
in the selection of cells which have been properly 
10 modified. 

It is yet another object of the present invention 
to provide a genome having therein, proximal to the coding 
region or exons of a gene of interest, a regulatory or 
amplifying segment which does not naturally appear 
15 thereat. 

It is another object of the present invention to 
provide DNA constructs which can be used for accomplishing 
the homologous recombination methods of the present 
invention. 

20 It is a further object of the present invention 

to provide cell lines and microorganisms which include the 
genomes in accordance with the present invention. 

These and other objects of the present invention 
are accomplished by means of the technique of homologous 

25 recombination, by which one of ordinary skill in this art 
can cause the expression and, preferably, amplification of 
resident, albeit transcriptionally silent genes. By this 
technique, one can also modify the expression 
characteristics of a gene which is naturally present, but 

3 0 not necessarily silent or inert, within the genome of a 
stable cell line, such as, for example, to make the 
expression conditional, i.e., repiressible or inducible, or 
to enhance the rate of expression. 

The present invention provides a method of 

35 modifying the expression characteristics of a gene within 
the genome of a cell line or microorganism. A DNA 
construct is inserted into that genome by the technique of 
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homologous recombination. The construct includes a DNA 
regulatory segment capable of modifying the expression 
characteristics of any gene to which it is operatively 
linked within the host cell line or microorganism, as well 
5 as a targeting segment homologous to a region of the genome 
at which it is desired for the DNA regulatory segment to be 
inserted. The construct and insertion technique is 
designed to cause the new DNA regulatory segment to be 
operatively linked to the gene of interest. Thus, without 

10 necessarily inserting any new coding exons, the expression 
characteristics of _ that gene are modified. In the 
preferred embodiment, the gene is one which is normally 
transcriptionally silent or inert within the host cell line 
or microorganism and, by means of the DNA regulatory 

15 region, which is targeted directly to the appropriate 

position with respect to that gene by means of homologous 
recombination, that gene is thereby activated for 
expression of its gene product. 

The DNA construct preferably includes two 

20 targeting segments which, while separated from one another 
in the construct by those elements to be inserted into the 
genome, are preferably contiguous in the native gene. 

The construct further preferably includes at 
least one expressible selectable marker gene, such as the 

25 gene providing neomycin resistance. This marker gene, 

including a promoter therefor, is also disposed between the 
two targeting regions of the construct. 

In another embodiment, the construct includes an 
expressible amplifiable gene in order to amplify expression 

3 0 of the gene of interest. This gene, including a promoter 
therefor, is also disposed between the two targeting 
regions of the construct. In some cases the selectable 
marker and the amplifiable marker may be the same. 

In a further embodiment of the present invention, 

35 the DNA construct includes a negative selectable marker 
gene which is not expressed in cells in which the DNA 
construct is properly inserted. This negative selectable 
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marker gene is disposed outside of the two targeting 
regions so as to be removed when the . construct is properly 
combined into the gene by homologous recombination. An 
example of such a negative selectable marker gene is the 
5 Herpes Simplex Virus thymidine kinase gene. 

In yet a further embodiment, it is possible to 
modify the expression characteristics of a specific gene 
which already expresses a product in the cell line or 
microorganism of interest. This can be accomplished by 

10 inserting by homologous recombination a DNA construct which 
includes (1) an expressible amplifiable gene which 
increases the copy number of the gene of interest when the 
cell line or microorganism is subjected to amplification 
conditions" and/ or (2) a promoter/ enhancer element (or other 

15 regulatory element) which modifies the expression of the 
gene of interest such as, for example, by increasing the 
rate of transcription, increasing translation efficiency, 
increasing mRNA accumulation, making the expression 
inducible, etc. The gene expression which is modified in 

2 0 this manner may be natural expression or expression which 
has been caused by previous genetic manipulation of the 
cell line or microorganism. The previous genetic 
manipulation may have been by conventional techniques or by 
means of homologous recombination in accordance with the 

25 present invention. In the latter case, the DNA insertion 
which results in the modification of expression 
characteristics may be accomplished as part of the same 
genetic manipulation which results in expression of the 
gene or may be performed as a subsequent step. 

30 The present invention also includes the 

constructs prepared in accordance with the above discussion 
as well as the genomes which have been properly subjected 
to homologous recombination by means of such constructs and 
the cell lines and microorganisms including these genomes. 



35 
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Moreover^ a process for preparation of the desired product 
by culturing the transformed cells according to the present 
invention is also included • 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows a general outline of a DNA construct 
in accordance with the present invention. 

Fig. 2A shows the mode of integration of the DNA 
construct into the genome in the event of non-homologous or 
10 random recombination. 

Fig. 2B shows the mode of integration of the DNA 
construct in the genome in the event of homologous 
recombination . 

Fig. 3 shows the construction of a preferred 
15 homologous recombination vector in accordance with the 
present invention. 

Fig. 4 shows the mode of integration of a 
circular piece of DNA by homologous recombination when only 
a single targeting piece of DNA is employed. 
20 Fig. 5 shows the pRSVCAT plasmid, including the 

restriction sites thereof. 

Fig. 6 shows the construction of the pRSV 
plasmid, including the restriction sites thereof. 

Fig. 7 shows the pSV2NEO' plasmid, including the 
^5 restriction sites thereof. 

Fig. 8 shows the construction of the pSVNEOBAM 
plasmid, including the restriction sites thereof. 

Fig. 9 shows the construction of the pRSVNEO 
plasmid, including the restriction sites thereof. 
3 0 Fig. 10 shows the construction of the pRSVCATNEO 

plasmid, including the restriction sites thereof. 

Fig. 11 shows a 15.3 kb fragment of the rat TSHB 
gene and showing various restriction segments thereof. 
Fig. 12 shows the construction of the 
55 PRSVCATNEOTSHB3 plasmid, including the restriction sites 
thereof . 
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Fig. 13 shows the construction of the 
pRSVCATNEOTSHB3-5XbaI plasmid, including the restriction 
sites thereof. 

Fig. 14 shows a portion of the nucleotide 
5 sequence of TSHB along with the regions thereof to which 

each primer for PGR amplification corresponds. Exons 2 and 
3 are shown in capital letters. A 2 47 BP amplified 
fragment is shown by underlined asterisks. 

Fig. 15 shows the results of polyacrylamide gel 
10 electrophoresis of cDNA synthesized from RNA extracted from 
various cell populations and whose TSHB cDNA, if present, 
has been amplified by PCR. The nature of the cells 
representing the various lanes is set forth in Fig. 15 
below the gel. 

15 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Homologous recombination is a technique developed 
within the past few years for targeting genes to induce or 
correct mutations in transcriptionally active genes 

2 0 (Kucherlapati, Prog. in Nucl. Acid Res, and Mol. Biol. . 

36:301 (1989)). This technique of homologous recombination 
was developed as a method for introduction of specific 
mutations into specific regions of the mammalian genome 
(Thomas et al.. Cell . 44:419-428^ 1986; Thomas and" 
25 Capecchi, Cell , 51:503-512, 1987; Doetschman et al. , Proc> 
Natl. Acad. Sci> . 85:8583-8587, 1988) or to correct 
specific mutations within defective genes (Doetschman et 
al.. Nature . 330:576-578, 1987). 

Through this technique, a piece of DNA that one 

3 0 desires to be inserted into the genome can be directed to a 

specific region of the gene of interest by attaching it to 
"targeting DNA". "Targeting DNA" is DNA that is 
complementary (homologous) to a region of the genomic DNA. 
If two homologous pieces of single stranded DNA (i.e., the 
35 targeting DNA and the genomic DNA) are in close proximity, 
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they will hybridize to form a double stranded helix. 
Attached to the targeting DNA is the DNA sequence that one 
desires to insert into the genome . 

There are a number of methods by which homologous 
5 recombination can occur. One example is during the process 
of replication of DNA during mitosis in cells. 

Through a mechanism that is not completely 
understood, parental double-stranded DNA is opened 
immediately prior to cell division at a local region called 

10 the replication bubble. The two separated strands of DNA 
may now serve as templates from which new strands of DNA 
are synthesized. One arm of the replication fork has the 
DNA code in the 5' to 3' direction, which is the 
appropriate orientation from which the enzyme DNA 

15 polymerase can "read". This enzyme attaches to the 5» 

portion of the single stranded DNA and using the strand as 
a template, begins to synthesize the complementary DNA 
strand. The other parental strand of DNA is encoded in the 
3' to 5' direction. It cannot be read in this direction by 

20 DNA polymerase. For this strand of DNA to replicate, a 
special mechanism must occur. 

A specialized enzyme, RNA primase, attaches 
itself to the 3' to 5' strand of DNA and synthesizes a 
short RNA primer at intervals along the strand. Using 

25 these RNA segments as primers, the DNA polymerase now 

attaches to the primed DNA and synthesizes a complementary 
piece of DNA in the 5' to 3* direction. These pieces of 
newly synthesized DNA are called Qkazaki fragments . The 
RNA primers that were responsible for starting the entire 

30 reaction are removed by the exonuclease function of the DNA 
polymerase and replaced with DNA. This phenomenon 
continues until the polymerase reaches an unprimed stretch 
of DNA, where the local synthetic process stops. Thus, 
although the complementary parental strand is synthesized 

3 5 overall in the 3« to 5* direction, it is actually produced 
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by "bacJcstitching" in the 5 ' to 3 ' direction. Any nicks 
that might occur in the DNA diiring the '"backstitching" 
process are sealed by an enzyme called DNA ligase. 

To maintain an absolute fidelity of the DNA code, 
5 a proofreading function is present within the DNA 

polymerase. The DNA polymerase requires primed pieces of 
DNA upon which to synthesize a new strand of DNA. As 
mentioned above, this can be a single strand of DNA primed 
with RNA, or a complementary strand of DNA. When the DNA 

10 polymerase finds mismatched complementary pieces of DNA, it 
can act as an exonuclease and remove DNA bases in a 3 ' to 
5' direction until it reaches perfect matching again. 

With this background, it is now possible to 
understand the basis of the technique described herein. 

15 Small pieces of targeting DNA that are complementary to a 
specific region of the genome are put in contact with the 
parental strand during the DNA replication process. It is 
a general property of DNA that has been inserted into a 
cell to hybridize and therefore recombine with other pieces 

2 0 of DNA through shared homologous regions. If this 

complementary strand is attached to an oligonucleotide that 
contains a mutation or a different sequence of DNA, it too 
is incorporated into the newly synthesized strand as a 
result of the recombination. As a result of the proof - 
25 reading function, it is possible for the new sequence of 

DNA to serve as the template. Thus, the transfected DNA is 
incorporated into the genome. 

If the sequence of a particular gene is known, a 
piece of DNA that is complementary to a selected region of 

3 0 the gene can be synthesized or otherwise obtained, such as 

by appropriate restriction of the native DNA at specific 
recognition sites bounding the region of interest. This 
piece will act as a targeting device upon insertion into 
the cell and will hybridize to its homologous region within 
35 the genome. If this hybridization occurs during DNA 

replication, this piece of DNA, and any additional sequence 
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attached thereto, will act as an Okazaki fracrment and will 
be backstitched into the newly synthesized daughter strand 
of DNA. 

In the technique of the present invention, 
5 attached to these pieces of targeting DNA are regions of 
DNA that are known to interact with the nuclear regulatory 
proteins present within the cell and, optionally, 
amplifiable and selectable DNA markers. Thus, the 
expression of specific proteins may be achieved not by 

10 transfection of DNA that encodes the gene itself and marker 
DNA, as is most common, but rather by the use of targeting 
- DNA (regions of homology with the endogenous gene of 
interest) coupled with DNA regulatory segments that provide 
the gene with recognizable signals for transcription. With 

15 this technology, it is possible to express and to amplify 
any cognate gene present within a cell type without 
actually transfecting that gene. In addition, the 
expression of this gene is controlled by the entire genomic 
DNA rather than portions of the gene or the cDNA, thus 

20 improving the rate of transcription and efficiency of mRNA 
processing. Fxarthermor e , the expression characteristics of 
any cognate gene present within a cell type can be modified 
by appropriate insertion of DNA regulatory segments and 
without inserting" entire coding portions of the gene of 

25 interest. 

In accordance with these aspects of the instant 
invention there are provided new methods for expressing 
normally transcriptionally silent genes of interest, or for 
modifying the expression of endogenously expressing genes 

30 of interest, within a differentiated cell line. The 

cognate genomic sequences that are desired to be expressed, 
or to have their expression modified, will be provided with 
the necessary cell specific DNA sequences (regulatory 
and/ or amplification segments) to direct or modify 

35 expression of the gene within the cell. The resulting DNA 
will comprise the DNA sequence coding for the desired 
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protein directly linked in an operative way to heterologous 
(for the cognate DNA sequence) regulatory and/ or 
amplification segments, A positive selectable marker is 
optionally included within the construction to facilitate 
5 the screening of resultant cells. The use of the neomycin 
resistance gene is preferred, although any selectable 
marker may be employed. Negative selectable markers may, 
optionally, also be employed. For instance, the Herpes 
Simplex virus thymidine kinase (HSVtk) gene may be used as 
10 a marker to select against randomly integrated vector DNA. 
The fused DNAs, or existing expressing DNAs, can be 
amplified if the targeting DNA is linked to an amplifiable 
marker . 

Therefore, in accordance with the method of the 
15 present invention, any gene which is normally expressed 
when present in its specific eukaryotic cell line, 
particularly a differentiated cell line, can be forced to 
expression in a cell line not specific for it wherein the 
gene is in a silent format. This occurs without actually 
2 0 inserting the full DNA sequence for that gene. In 

addition, that gene, or a normally expressing gene, can be 
amplified for enhanced expression rates. Furthermore, the 
expression characteristics of genes not totally 
transcriptionally silent can be modified as can the 
25 expression characteristics of genes in microorganisms. 

In one embodiment of the present invention, 
eukaryotic cells that contain but do not normally 
transcribe a specific gene of interest are induced to do so 
by the technique described herein. The homologous 
30 recombination vector described below is inserted into a 
clonal cell line and, following chemical selection, is 
monitored for production of a specific gene product by any 
appropriate means, such as, for example, by detection of 
loRNA transcribed from the newly activated gene, 
35 immunological detection of the specific gene product, or 
functional assay for the specific gene product. 
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The general outline of the DNA construct that is 
used to transcriptionally activate endogenous genes by 
homologous recombination is depicted in Fig\ire l. 

In general, the DNA construct comprises at least 
two and up to six or more separate DNA segments. The 
segments consist of at least one, preferably two, DNA 
targeting segments (A and B) homologous to a region of the 
cell genome within or proximal to the gene desired to be 
expressed, a positive selection gene (C) , an amplif iable 
gene (D) , a negative selection gene (E) and a DNA 
regulatory segment (F) which is transcriptionally active in 
the cell to be transfected. In the most basic embodiment- 
of the present invention, only a single targeting segment 
(B) and the regulatory segment (F) must be present. All of 
the other regions are optional and produce preferred 
constructs . 

Regions A and B are DNA sequences which are 
homologous to regions of the endogenous gene of interest 
which is to be made transcriptionally active. The specific 
regions A and B of the endogenous gene are selected so as 
to be upstream and downstream, respectively, of the 
specific position at which it is desired for the regulatory 
segment to be inserted. Although these regions are 
separated in the construct they are preferably contiguous 
25 in the endogenous gene. There may be occasions where non- 
contiguous portions of the genome are utilized as targeting 
segments, for example, where it is desired to delete a 
portion of the genome, such as a negative regulatory 
element. 
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While two targeting regions, a and B, are 
preferred in order to increase the total regions of 
homology and thus increase recombination efficiency, the 
process of the present invention also comprehends the use 
of only a single targeting region. In its simplest form 
(when only the regulatory segment F and the selectable 
marker gene C and promoter C are to be inserted), a 
circular piece of DNA is employed which contains these 
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elements along with the targeting DNA (see Figure 4) . In 
this way, the homologous region (B) hybridizes with its 
genomic counterpart. Segments C*, C and F are inserted 
within the B portion of the cognate gene following the 
5 crossover event. 

When it is desired for the DNA regulatory 
sequence to be inserted upstream of the gene of interest, 
as, for example, when it is desired to activate and express 
a normally transcriptionally silent gene, the region of 
10 homology is preferably homologous to a non-coding portion 
of the genome upstream of the coding portions of the gene 
of interest* When two targeting regions are present, the 
downstream region (A) may include a portion of the coding 
region, although it is preferred that it, too, be totally 
15 upstream of the coding region. It is further preferred 
that the homologous regions be chosen such that the DNA 
regulatory sequence will be inserted downstream of the 
native promoter for the gene of interest, particularly if 
the native promoter is a negative promoter rather than a 
20 turned-off positive promoter. 

The size of the targeting regions, i.e., the 
regions of homology, is not critical, although the shorter 
the regions the less likely that they will find the 
appropriate regions of homology and recombine at the 
25 desired spot. Thus, the shorter the regions of homology, 
the less efficient is the homologous recombination, i.e., 
the smaller the percentage of successfully recombined 
clones. It has been suggested that the minimum requirement 
for sequence homology is 25 base pairs (Ayares et al, Pnas. 
30 USA . 83:5199-5203, 1986). Furthermore, if any of the other 
elements of the construct are also found in the genome of 
the host cell, there is a possibility of recombination at 
the wrong place. However, in view of the excellent 
positive and negative selectability of the present 
35 invention, it can be successfully practiced even if the 

efficiency is low. The optimum results are achieved when 
the total region of homology, including both targeting 
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regions, is large, for example one to three kilobases. As 
long as the regulatable segment F can bfe operatively linked 
to the gene of interest there is no limit to the size of 
the targeting region, and particularly the upstream 
targeting region B. 

It can easily be empirically determined whether 
or not the targeting regions are too large or the 
regulatable segment F spaced too far from the coding region 
of the gene to be operatively linked thereto. In such a 
case, the regions A and B can be made homologous to a 
different section of the gene of interest and the process 
repeated until the regulatable segment F is properly 
inserted so as to be operatively linked to the gene of 
xnterest. For example, the restriction site of combined 
region A-B of the endogenous gene can be changed and the 
process repeated. Once the concept of the present 
invention is known, along with the techniques disclosed 
herein, one of ordinary skill in this art would be able to 
make and use the present invention with respect to any 
gxven gene of interest in any cell line or microorganism 
without use of undue experimentation. 

Region C is a positive selectable marker gene 
Which is capable of rendering the transfected cell line 
resistant to a normally toxic environment. Examples of 
such genes are adenosine deaminase (ADA) , aminoglycoside 
Phosphotransferase (neo) , dihydrof olate reductase (DHFR) 
hygromycin-B-phosphotransf erase (HPH) , thymidine kinase ' 
(tk) , xanthine-guanine phosphoribosyltransf erase (gpt) 
multiple drug resistance gene (mdr) , ornithine 

decarboxylase (ODC) and N-(phosphonacetyl) >L-aspartate 
resistance (CAD) . 

In addition to the positive selectable marker 
gene, an amplifiable gene is also optionally included in 
the construct at region D. Amplifiable genes are genes 
that lead to an increase in copy number when under 
selective pressure. The copy number of a gene positioned 
adjacent to the amplifiable gene will also increase 
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Amplif iable genes that can be utilized include DHFR, MDR, 
ODC, ADA and CAD. The members of the positive selectable 
marker gene group and those of the amplif iable gene group 
overlap so that, in theory, instead of using two genes, one 
5 for positive selection and one for amplification, one gene 
could be used for both purposes. However, since most cell 
lines contain endogenous copies of these amplif iable genes, 
the cells will already be somewhat resistant to the 
selection conditions and distinguishing the cells which 
10 have transfected DNA from those which do not receive 

transf acted DNA can be difficult. Thus, in instances where 
an amplif iable gene is desired, a positive selection gene 
which is dominant, such as HPH, gpt, neo and tk (in tk- 
cells) , should also be included in the construct. For some 
15 applications it may be possible or preferable to omit the 

amplif iable marker. For instance, the gene of interest may 
not need to be amplified as, for example, when 
transcriptional activation by the heterologous DNA 
regulatory sequence is sufficient without amplification. 
20 Also, if the homologous recombination efficiency is very 

low, it may be necessary to leave out the amplifiable gene 
since the ratio of non-homologous DNA to homologous DNA is 
directly related to the homologous recombination efficiency 
(Letsou, Genetics . 117:759-77*0,' 1987). It is also possible 
25 to eliminate the positive selection gene and select cells 
solely by screening for the production of the desired 
protein or mRNA. However, it is preferred in most cases to 
include at least the positive selection gene. 

Region E of the construct is a negative 
30 selectable marker gene. Such a gene is not expressed in 
cells in which the DNA construct is properly inserted by 
homologous recombination, but is expressed in cells in 
which the DNA construct is inserted improperly, such as by 
random integration. One such gene is the Herpes Simplex 
35 Virus thymidine kinase gene (HSVtk) . The HSVtk has a lower 
stringency for nucleotides and is able to phosphorylate 
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nucleotide analogs that normal mammalian cells are unable 
to phosphorylate. If the HSVtk is present in the cells, 
nucleotide analogs such as acyclovir and gancyclovir are 
phosphorylated and incorporated into the DNA of the host 
5 cell thus killing the cell. The presence of the negative- 
selectable marker gene enables one to use the positive- 
negative selection for homologous recombination as 
described by Mansour et al ( Nature . 336:348-352, 1988). 
Capecchi uses a strategy which takes advantage of the 
10 differing modes of integration that occur when linearized 

vector DNA inserts via homologous recombination as compared 
to when it inserts by random integration. If the vector 
DNA inserts randomly, the majority of the inserts will 
insert via the ends (Folger et al, Mol. Cell . Biol 
15 2:1372-1387, 1982; Roth et al, Mol. C^,n . nir.i 5:2599- 

2607, 1985; and Thomas et al. Cell , 44:419-428, 1986). on 
the other hand, if the vector inserts by homologous 
recombination, it will recombine through the regions of 
homology which cause the loss of sequences outside of those 
2 0 regions . 

Using the construct depicted in Figure i as an 
example, the mode of integration for homologous 
recombination versus random integration is illustrated in 
Figures 2A and 2B. In the case of non-homologous 
recombination (Figure 2A) , the vector is inserted via the 
ends of the construct. This allows region E, in this case 
the HSVtk gene, to be inserted into the genome. However, 
when homologous recombination occurs (Figure 2B) , the HSVtk 
gene is lost. The first round of selection uses the 
appropriate drug or conditions for the positive selection 
present within the construct. Cells which have DNA 
integrated either by homologous recombination or random 
integration will survive this round of selection. The 
surviving cells are then exposed to a drug such as 
gancyclovir which will kill all the cells that contain the 
HSVtk gene. In this case, most of the cells in which the 
vector integrated via a random insertion contain the HSVtk 
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gene and are killed by the drug while those in which the 
vector integrated by homologous recombination have lost the 
HSVtk gene and survive. This allows the elimination of 
most of the cells which contain randomly integrated DNA, 
5 leaving the majority of the surviving cells containing DNA 
which integrated via homologous recombination* This 
greatly facilitates identification of the correct 
recombination event . 

The negative selection step can also be 
10 eliminated if necessary. It will require that the 

screening step be more labor intensive involving the need 
for techniques such as polymerase chain reaction (PGR) or 
immunological screening . 

The sixth region (F) contains the DNA regulatory 
i5 segment that will be used to make the gene of interest 

transcriptionally active. The appropriate DNA regulatory 
segment is selected depending upon the cell type to be 
used. The regulatory segment preferably used is one which 
is known to promote expression of a given gene in 
20 differentiated host cell line. For example, if the host 
cell line consists of pituitary cells which naturally 
express proteins such as growth hormone and prolactin, the 
promoter for either of these genes can be used as DNA 
regulatory element F. When inserted in accordance with the 
25 present invention, the regulatory segment will be 

operatively linked to the normally transcriptionally silent 
gene of interest and will stimulate the transcription 
and/or expression of that gene in the host cell line. Also 
usable are promiscuous DNA regulatory segments that work 
30 across cell types, such as the rous sarcoma virus (RSV) 
promoter. As long as the regulatory segment stimulates 
transcription and/or expression, or can be induced to 
stimulate transcription and/or expression, of the gene of 
interest after being inserted into the host cell line so as 
35 to be operatively linked to the gene of interest by means 
of the present invention, it can be used in the present 
invention. It is important when joining the regulatory 
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segment F to the targeting segment A that no starting codon 
be accidentally introduced into the sequence since such an 
occxirrence could alter the reading frame of the gene which 
is desired to be expressed. Of course, the construct must 
be constructed and inserted such that the regulatory 
segment F is operatively linked to the gene of interest. 

The DNA regulatory segment, region F, need not be 
present in instances where it is desired to enhance or 
amplify the transcription of a gene which is already 
expressing in the cell line of interest, either because it 
naturally expresses in that cell line or because the cell 
line has previously had its DNA manipulated to cause such 
expression. In such instances, insertion of an amplifiable 
gene, region D, preferably with the positive selectable 
15 marker gene, region C, and optionally also with a negative 
selectable marker gene, region E, will be sufficient to 
increase the copy niimber of the gene of interest and thus 
enhance the overall amount of transcription. 
Alternatively, a new regulatory segment, region F, 
20 inherently promoting an increased (or otherwise modified) 
rate of transcription as compared to the existing 
regulatory region for the gene of interest, may be included 
to further enhance the transcription of the existing 
expressing gene of interest. Such a new regulatory segment 
25 could include promoters or enhancers which improve 
transcription efficiency. 

Regions C, D' and E* are promoter regions which 
are used to drive the genes in regions C, D, and E, 
respectively. These promoters are transcriptionally active 
in the cell line chosen and may be the same or different 
from the promoter in region F used to drive the endogenous 
gene of interest. The specific direction of transcription 
specified in Fig. 1 is not critical. Those of ordinary 
skill in this art can determine any appropriate placement 
35 of the genes C, D and E and their promoters C, and E* 
such that the promoters will stimulate expression of their 
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associated genes without simultaneously disrupting in any 
way the expression of the gene of interest or any of the 
other genes of the construct • 

The present invention may be illustrated by 
5 reference to the activation of the rat thyrotropin beta 

subunit (TSHB) in GH^ (ATCC CCL 82), GH3 (ATCC CCL 82,1) or 
GH4 CI cell lines (GH) . GH cell lines are derived from a 
radiation induced pituitary ttimor in rats designated MtT/W5 
(Takemoto, Cancer Res . , 22:917, 1962) and adapted to grow 

10 in culture by Tashjian et al. Endocrinology . 82:342-352, 
1968. These cell lines may be subcloned and screened for 
their ability to produce growth hormone and TSHB, Such 
screening may preferably be by means of Northern blot 
analysis to determine whether mRNA for the rat growth 

15 hormone gene is present and to establish that there is no 

mRNA for the TSHB gene being produced. The cell lines may 
also be screened by Southern analysis to determine that 
there is at least one copy of the TSHB gene present within 
the genome. Only the GH cell lines that produce growth 

20 hormone and not TSHB, but contain a copy of the TSHB gene, 
are used. 

The specific homologous recombination vector for 
use in GH cells may be designed in the following manner 
(Figure's). Region A may consist of the 5» upstream 

25 untranslated region of the TSHB' gene defined by the Hindlll 
fragment which stretches from -74 to -2785 and region B may 
contain the DNA fragment that stretches from the -27 85 
Hindlll site to a Ncol site approximately 2.1 kb further 
upstream as described by Carr et al ( J. Biol. Chem. . 

30 262:981-987, 1987) and Croyle et al (DNA, 5:299-304, 1986). 
The positive selection gene (region C) may be a 1067 bp 
Bglll-Smal fragment derived from the plasraid pSV2neo (ATCC 
No. 37,149) (Southern et al, J. Mol. AppI. Gen. , 1:327- 
341, 1982). The neo gene may be driven by the Rous 

35 Sarcoma Virus (RSV) promoter (region C') which is derived 
from the Ndel-Hindlll fragment from the plasmid pRSVcat 
(ATCC No. 37,152) (Gorman et al, PNAS , 79:6777-6781, 
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1982) . In this example, no amplifiable marker need be 
used and thus there need be no region b in order to 
optimize the efficiency of the homologous recombination. 
The efficiency is inversely related to the proportion of 
5 non-homologous to homologous sequences present in the 
construct (Letsou et al. Genetics, 117:759-770, 1987). 
Region E, or the negative selection gene, may consist of 
the HSVtk gene which is a 2 kb Xho fragment obtained from 
the plasmid pMCITK plasmid (Capecchi et al. Nature, 
10 336:348-352, 1988). The HSVtk gene in that construct may 
be driven by the polyoma virus promoter and enhancer 
(region E') as constructed by Thomas et al ( Cell , 51:503- 
512, 1987). In a second DNA construct the polyoma promoter 
may be replaced by the RSV promoter described above. The 
DNA regulatory sequence used to activate the TSHB gene may 
be either the RSV promoter or the rat growth hormone 
promoter. The rat growth hormone promoter consists of the 
SacI-EcoRI fragment obtained from the plasmid pRGH237CAT 
(Larson et al, PNAS, 83:8283-8287, 1986). The RSV promoter 
has the advantage of being usable in other cell lines 
besides GH cells, while the GH promoter is known to be 
active in GH cells and can be specifically induced (Brent 

al, J. Biol. Chem., 264:178- 182, 1989). The rat growth 
hormone promoter and the RSV promoter may be' inserted at 
2^ location F in separate constructs. 

Following transfection of the above construct 
into a GH cell line, the cells may be grown in media that 
contains G418. This will allow only those cells which have 
integrated plasmid DNA into the genome either by homologous 
recombination or random integration to grow. The surviving 
cells may be grown in media that contains gancyclovir. The 
majority of the cells that survive this round of selection 
will be those in which the vector plasmid DNA is integrated 
via homologous recombination. These cells may be screened 
to demonstrate that they are producing mRNA which 
corresponds to the TSHB gene and that they are producing 
the TSHB protein. The genomic DNA may also be sequenced 
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around the area of insertion of the heterologous promoter 
to insure that the proper recombination event occurred. 

EXAMPLE - Activation of TSHfi Gene in Rat Pituitary Cells 
5 Using the following protocol, thyrotropin beta 

subunit (TSHB) gene transcription, which normally does not 
occur in the rat GHj pituitary cell line, was activated in 
those cells by using the process of homologous 
recombination to target an activating element upstream of 

10 the TSH5 coding region. The Rous Sarcoma Virus (RSV) 
promoter is known -to function efficiently in GH3 cells 
(Christian Nelson et al. Nature, 322:557-562 (1986); Zheng- 
Sheng Ye et al, The Journal of Biological Chemistry . 
263:7821-7829 (1988)) and therefore was chosen as the 

1^ activating element. A plasmid vector was constructed which 
contained the RSV activating element, portions of the 5' 
flanking region of the TSHiS gene locus, and a selectable 
drug marker, aminoglycoside phosphotransferase gene (NEO) , 
for the isolation of transfected cell populations. 

20 Ribonucleic acid (RNA) was extracted from pooled drug 
resistant GH3 cell populations and converted to 
complementary deoxyribonucleic acid (cDNA) . The cDNA was 
then screened by the technique of polymerase chain reaction 
(PCR) for the presence of TSHB cDNA. The constuction of 
25 the homologous recombination vectors and the control 
vectors is outlined below along with the experimental 
procedures and results. 

PLASMID CONSTRUCTION 
30 Homologous Recombination fHR) Backbone Vector 

fpRSVCATNEO) . 

The Rous Sarcoma Virus (RSV) promoter was derived 

from the plasmid pRSVCAT (Cornelia M. Gorman et al.. 

Proceedings of the National Academy of Science . 79:6777- 
35 6781 (1982)) (figure 5) by isolating the 580 base pair (bp) 

Ndel - Hindlll fragment containing the functional promoter 
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unit. The ends of this fragment were blunted using DNA 
polymerase I Klenow fragment and Xbal linkers ligated to 
the blunt ends. After digestion with Xbal restriction 
endonuclease and gel purification, the resulting fragment 
5 was ligated into the Xbal site of pUClS . A bacterial 
colony harboring a plasmid with the RSV insert in the 
orientation shown in figure 6 was designated pRSV. The 
aminoglycoside phosphotransferase gene (NEO) was cloned 
from PSV2NEO (P.J. southern et al.. Journal of MoleculaT- 
and Applied Genetics, 1:327-341 (1982)) by isolating the 
Bglll and BamHI fragment (figure 7) and ligating that 
fragment into the BamHI site of pRSV (figure 6)... A plasmid 
containing the NEO gene in the orientation shown in 
figure 8 was picked and designated pRSVNEOBAM. pRSVNEOBAM 
was digested with Smal and the 4328 bp fragment containing 
the RSV promoter region, the majority of the NEO gene and 
pUClS was isolated by gel electrophoresis. The Smal ends 
of this fragment were Xhol linkered, cleaved with Xhol 
restriction enzyme and the plasmid recircularized by 
20 ligation. The resulting plasmid is shown in figure 9 and 
is called pRSVNEO. This last cloning step resulted in the 
deletion of a 786 bp fragment from the 3* end of the NEO 
fragment which is not necessary for its functional 
expression. This construction yields a plasmid in which 
the NEO gene is transcriptionally driven by the RSV 
promoter . 

Next the Ndel site located 5' of the RSV promoter 
in pRSVCAT (figure 5) was converted to a Sail site. This 
was accomplished by digesting pRSVCAT with Ndel, filling in 
the ends using DNA polymerase I Klenow fragment and 
ligating Sail linkers to the resulting blunt ends. The 
linkers were digested to completion with Sail and the 
plasmid recircularized by ligation. Into the newly 
constructed Sail site was cloned the Sail - Xhol fragment 
from PRSVNEO (figure 9) containing the RSV promoter and the 
NEO gene. A plasmid with the RSV promoter and NEO fragment 
oriented as shown in figure lo was isolated and named 
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pRSVCATNEO. This plasmid when transfected into GH3 cells 
was capable of conferring G418 resistance to those cells, 
demonstrating the ability of the RSV promoter to drive 
transcription of the NEO gene and the ability of that RNA 
5 to be translated into a functional protein (data not 

shown) • Total RNA from the stable transf ectants above was 
analyzed by polymerase chain reaction (PGR) to determine 
whether the CAT gene was being transcribed. PGR results 
showed that the GAT gene was indeed being transcribed in 

10 all the G418 resistant colonies tested (data not shown) , 
indicating that the RSV promoter 5 ' of the GAT gene was 
qapable of driving transcription of a gene located 3 ' to 
it. This is important because this RSV promoter will be 
responsible for driving transcription of the TSH6 gene when 

15 the TSH6 HR vector described below integrates via 
homologous recombination into the GH3 genome. 

TSHB HR Vector 

A vector capable of integrating into the GH3 
2 0 genome by homologous recombination was created by inserting 
two stretches of the 5* flanking regions of the thyrotropin 
beta subunit (TSHB) gene into the unique Sail and Hindlll 
sites contained in pRSVGATNEO (figure 10) . . A rat spleen 
genomic library containing inserts of 15 kilobases (kb) or 

2 5 greater cloned into lambda DASH was obtained from 

Stratagene^ San Diego, GA. Using standard protocols 
( Current Protocols in Molecular Bioloov . pp. 1.9.1 - 1.13.6, 
6.1.1 - 6.4.10) a 15.3 kb clone of the rat genomic TSHB 
gene including 9kb of sequence 5 » of the first exon was 

3 0 isolated. The 15.3 kb fragment consisted of two Xbal 

fragments, a 10.6 kb fragment corresponding to the 5' end 
of the 15.3 kb fragment and a 4.7 kb piece corresponding to 
the 3* region of the 15.3 kb fragment (figure 11). Both of 
these Xbal fragments were subcloned into pOC18 and plasmids 
3 5 containing inserts in both orientations were isolated. The 
2.3 kb Xbal - Hindlll fragment contained in the 4.7 kb Xbal 
fragment (figure 11) was purified and the Xbal site of this 
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fragment was converted to a Hindlll site by filling in the 
ends with Klenow fragment and ligating on Hindlll linkers. 
This fragment was ligated into the unique Hindlll site 
contained in pRSVCATNEO (figure 10) . An isolate 
5 corresponding to a plasmid with the 2.3 kb insert in the 
correct orientation as shown in figure 12 was assigned the 
name pRSVCATNEOTSHBS . 

The subcloned 10.6 kb Xbal fragment from the rat 
TSHB clone (figure 11) was isolated and the Xbal ends 
converted to Sail sites by blunt ending the fragment with 
DNA polymerase I Klenow fragment and attaching sail 
linkers. This 10.6 kb Sail fragment was then cloned into 
the Sail site of pRSVCATNEOTSHBS (figure 12) . A plasmid 
containing the insert in the correct orientation was 
identified and named pRSVCATNE0TSHB3-5XbaI (figure 13) . 
The latter plasmid has been deposited in the American Type 
Culture Collection, Rockville, MD, and has received 
depository number ATCC 40933. For the purpose of this 
deposit, the plasmid was renamed pHRTSH. This deposit was 
made in accordance with all of the requirements of the 
Budapest Treaty. 
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CELL LINE 

GHj cells are a subcloned population " of MtT/W5 
which was derived from a radiation induced pituitary tumor 
in rats (B.K. Takemoto, Cancer p^^^^^r-K, 22:917 (1962)) 
and adapted to growth in culture by Tashjian et al.. 
Endocrinology , 82:342-352 (1968). The GH3 cells were 
obtained from the American Type Culture Collection cell 
bank and are maintained in culture by growth in Dulbecco's 
Modified Eagle's Medium (DMEM) + 15% horse serum (HS) + 
2.5% fetal bovine serum (FBS) + 1% L-glutamine (GH, media) 
at 37 »C in 5% CO2 . 
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DNA PREPARATION 

Large-Scale Preparation of Plasmid DNA 

All plasmids used for stable transf ections were 
purified by using the alkaline lysis method for large-scale 
5 plasmid DNA purification as described in Current Protocols 
in Molecular Biology > vol. 1, pp. 1.7.1 - 1.7.2. DNA 
isolated by the alkaline lysis method was further purified 
by double banding in a cesium chloride gradient as also 
described in Current Protocols in Molecular Biology , vol. 
10 1, pp. 1.7.5 - 1.7.7. 

Prior to transf ect ion, the HR vectors were 
digested with either Aatll or Apal. Apal was used to 
linearize the control plasmid pRSVCATNEO and Aatll to 
linearize the HR plasmid pRSVCATNEOTSHB3-5XbaI . The 
15 location of the cleavage sites of Apal and Aatll can be 
seen in figures 10 and 13 respectively. After digestion 
with the appropriate restriction enzyme, the reaction was 
phenol /chloroform extracted, chloroform extracted, ethanol 
precipitated, and washed once with 70% ethanol. The 
20 plasmids were then resuspended in sterile deionized water 

(dH2 0) to a concentration of 1 microgram/microliter (/xg/Ail) 
as determined by absorbance at OD250 • ^^i attempt to 

increase the transfection efficiency and/or the ratio of 
homologous recombination positives to those that" were due 
25 to random integration, pRSVCATNEOTSHB3-5XbaI was digested 
with Apal . Digestion with Apal cuts at three separate 
sites in pRSVCATNEOTSHB3-5XbaI and removes all regions of 
the vector except those necessary for homologous 
recombination (figure 13). After digestion with Apal, the 
3 0 reaction was electrophoresed on a 0.8% agarose gel and the 
top band corresponding to the 10,992 bp fragment containing 
the two 5' flanking regions of the TSH6 gene, the RSV 
promoter - NEO region and the TSHB gene-activating RSV 
promoter was isolated from the gel by electroelution into 
35 dialysis tubing. The electroeluted DNA was further 

purified by using an elutip minicolumn (Schleicher and 
Schuell) with the manufacturer's recommended standard 
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protocol. The DNA was eluted from the colrmm, ethanol 
precipitated, washed with 70% ethanol and resuspended to a 
concentration of 1 fig/iil. 

STABLE TRANSFECTIONS 
Calcium Phosphate Transfection 

48 hours prior to transfection 3 x 10* GH5 cells 
were plated on 10 centimeter (cm) dishes. For each dish, 
10 ng of vector DNA along with 30 ^lg of sonicated salmon 
sperm DNA was added to 0.5 milliliters (ml) of tranfection 
buffer. The transfection buffer was prepared by combining 
4g NaCl , 0 . 185g Kcl , 0 . 05g Naj HPO4 , 0 . 5g dextrose, 2 , 5g 
HEPES and dHj O to a final volume of 500 ml and bringing the 
pH to 7.5. 31 Ml of 2 molar (M) CaClj was added to the 0.5 
15 ml of DNA + transfection buffer and vortexed. This 

solution was allowed to stand at room temperature for 45 
minutes. When the DNA - CaCls - transfection buffer was 
ready, the GH3 medium was removed from the GH3 cells and 
the DNA - CaCl2 - transfection buffer was layered over the 
20 cells. The cells were allowed to stand at room temperatiare 
for 20 minutes. After 20 minutes, 5 ml of GH3 medixim was 
added and the plates were incubated at 37 »C for 6 hotirs. 
The cells were then shocked by aspirating off the medium 
and adding 5 ml of fresh transfection buffer containing 15% 
25 glycerol for 3.5 minutes. The cells were rinsed 2x with 
PBS and fed with 10 ml of GHj medixam. 48 hours post- 
transfection, the medi\am was removed and 10 ml of GH3 
medium containing 400 fig/ml G418 was added. 

30 Electroporation 

El ectr operation was carried out using a BTX 300 
Transfector with 3.5 millimeter (mm) gap electrodes. 1 x 
107 GH3 cells growing in log phase were removed from their 
plates by trypsinization, pelleted by centrifugation and 

35 washed once with PBS. Cells were resuspended in 1.0 ml of 
PBS and transferred to 2.9 ml Ultra-UV disposable cuvettes 
(American Scientific Products) on ice. 10 ng of DNA was 
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added to the cells, mixed and placed back on ice for 5 
minutes. After 5 minutes the electrodes were placed in the 
chamber and the cells were electroporated at a setting of 
750 microfarads with a 200 volt pulse. The cuvette was 
5 then returned to ice for 10 minutes. Cells were 
transferred from the cuvette to 9 ml of GH3 medium 
containing 1% penicillin and 1% streptomycin at room 
temperature in a 15 ml conical tube and allowed to stand 
for 10 minutes. The total electroporation of 1 x 10^ cells 
10 was transferred to three 10 cm plates giving approximately 
3 X 10^ cells per plate. After 4 8 hours, the GH3 medium 
containing 400 /xg/ml G418 was added. 

Transfection of GH3 cells with pRSVCATNEOTSHB3-5Xbal (Aatll 
cut) . pRSVCATNEOTSHB3-5XbaI fAoal cut) and pRSVCATNEO fApal 
cut) 

pRSVCATNEOTSHB3-5XbaI (Aatll cut) , 
pRSVCATNEOTSHB3^5XbaI (Apal cut) and pRSVCATNEO (Apal cut) 
plasmids were transfected into GH3 cells along with a no 
DNA control using both the calcium phosphate protocol and 
2^ the electroporation protocol. 48 hours after transfection, 
the cells were put under G418 selection. Approximately 14 
to 21 days later the colonies became visible by eye on the 
10 cm dishes and were counted. In all of the no DNA 
controls, there were no visible colonies, demonstrating 
that the G418 selection was working and that the presence 
of a plasmid containing the RSV - NEO region was necessary 
to confer G418 resistance. At this time, colonies were 
picked and pooled by isolating regions on the 10 cm dish 
with 17 millimeter wide cloning rings. These large cloning 
rings encompassed between 10 and 70 colonies depending on 
the density of the colonies per plate and allowed the GH3 
cells in that isolated region to be removed and pooled at 
the same time by trypsination. The trypsinized colonies in 
each ring were transferred to 6 well plates and allowed to 
grow in GH3 media containing G418. After reaching 70% to 
80% confluence, 80,000 cells were transferred to a 24 well 
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plate and the remaining cells cryopreserved for further 
testing at a later date. The cells in "the 24 well plates 
were grown until they reached 50% to 80% confluence. Total 
RNA was then harvested from these GH3 cells by the 
5 following procedure. 

RNA ISOLATION FROM TRANSFECTED GH3 CELLS GROWN IN 24 WELL 
PLATES 

The following is a modification of the protocol 
10 described by Chomczynski and Sacchi, Anal. Biochem. . 

162:156-159 (1987). The media covering the GH3 cells in 
the 24 well plates was removed and the cells washed with 1 
ml of PBS. 1 ml of GTC solution was added and the cells 
were incubated at room temperature for 5 minutes. GTC 
15 solution was prepared by dissolving 250 g of guanidium 
thiocyanate (Fluka) in 293 ml of dHjO, and then adding 
17.6 ml of 0.75 M Na citrate pH 7.0 and 26.4 ml of 10% 
sarcosyl (L-Lauryl sarcosine) . Just prior to use, 360 ^1 
of 6-mercaptoethanol per 50 ml GTC solution was added. 
20 After 5 minutes at room temperature, the 1 ml of GTC-cell 
lysate was transferred to a Sarstedt 55.518 snap-cap tube 
containing 2 ml of GTC solution. To each tube was added 
300 Hi of 2M sodium acetate pH 4.0 and the tube vortexed. 
Next, 3 ml of dHjO saturated phenol was added and the tubes 
25 were vortexed again. To each tube was added 600 ^1 of 

chloroform :isoamyl alcohol (49:1) and the tube was shaken 
by hand for 10 seconds and placed on ice for 15 minutes. 
The tubes were then centrifuged in a Sorval RC-5B using a 
SM24 rotor at 8000 revolutions per minute (RPM) for 20 
30 minutes at 4<»C. The aqueous phase was transferred to a 
fresh Sarstedt tube containing 3 ml of isopropanol and 
placed at -20 "C for 1 hour. After 1 hour the tubes were 
spun in a Sorval RC-5B using a SM24 rotor at 8000 rpm for 
20 minutes at 4''C. The supernatants were removed and the 
35 pellets resuspended in 500 jul of GTC solution. The 

resuspended RNA was transferred to a 1.5 ml eppendorf tube 
to which 500 ^ll of isopropanol was added. The tubes were 
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once again placed at -20**C for 1 hour. The eppendorf tubes 
were spun for 5 minutes in a microfuge and the supernatant 
discarded. The pellet was washed with 70% ethanol 2 times 
and allowed to dry until the ethanol had completely 
5 evaporated. The pellet was resuspended in 2 0 /il of diethyl 
pyrocarbonate (depc) treated water and heated to SS^'C for 5 
minutes. This RNA was then used to make cDNA in one of the 
two procedures described below. 

10 CDNA REACTIONS 
Method 1 

First strand cDNA was synthesized from 2.5-6.0 
microliters of total RNA (approximately 0.5-6 micrograms) 
in a reaction volume of 10-20 microliters. The total RNA 

15 was obtained by the extraction method described above, and 
was denatured for 5-10 minutes at IQ'^C and quick chilled on 
ice before adding the reaction components. The reaction 
conditions were 50 millimolar (mM) Tris-HCl (pH 8.3), 10 mM 
MgCl2 , 10 mM DTT, 0.5 mM each of dCTP, dATP, dGTP, and dTTP 

20 (Pharmacia) , 40 mM KCl, 500 units/ml RNasin (Promega 

Biotech), 85 fig/ml oligo (dT) ^ 2 - 1 s (Collaborative Research, 
Inc.), and 15,000-20,000 units/ml Moloney murine leukemia 
virus reverse transcriptase (Bethesda Research 
Laboratories) incubated at 37 for 60 minutes. The 

25 reaction was terminated by the addition of EDTA to 4 0 mM, 
and the nucleic acid was precipitated by adding sodium 
acetate to a concentration of 0.3 M and two volumes of 
ethanol. The precipitate was allowed to form at 0*^0 for 3 0 
minutes and was pelleted by centrifugation in a microfuge 
30 at 14,000 rpm for thirty minutes. The pellet was washed 
with 70% ethanol, dried, and resuspended in depc treated 
water to a volume of 15-25 microliters. 

Method 2 

35 Conditions for first strand synthesis of cDNA 

from RNA were adapted from Carol A. Brenner et al, 
BioTechniaues , Vol. 7, No. 10, pp. 1096-1103 (1989). 1 ^ll 
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Of total RNA from the RNA prep procedure described above 
was added to 9 /il of reaction buffer in a 0*5 ml eppendorf 
tube. The reaction buffer consisted of 200 units of 
Moloney murine leukemia virus reverse transcriptase (MMLVRT 
5 Bethesda Rese'sarch Labs) , and a final concentration of the 
following reagents: 70 mM Tris.HCl pH 8.8, 40 mM KCl, 0.1% 
Triton X-100, 1 mM of each dNTP, 4 mM MgClj , and 0.45 ODj^o 
units of random hexamers (Pharmacia) . After mixing, the 
tubes were incubated at room temperature for 10 minutes and 
10 then placed at 42**C for 1 hour. After 1 hour the tubes 

were heated to 90**C for 1 minute to deactivate the MMLVRT 
and then cooled to room temperature. 

POLYMERASE CHAIN REACTION (PCR) AMPLIFICATION OF RNA FROM 
^5 GH3 CELLS 

The following primers were used to amplify, by 
PCR, TSHfi cDNA synthesized from RNA transcripts produced by 
the GH3 cells as a result of the HR plasmids activating the 
endogenous TSHB gene by homologous recombination. 

2 0 primer 5' 3« 

TSHB 5 AGTATATGATGTACGTGGACAGG 
TSHB 3 CACTTGCCACACTTGCAGCTCAGG 

Figure 14 shows the regions of the TSHB gene to 
25 which each primer corresponds. 

PCR REACTION CONDITIONS 

All PCR reactions were performed in the Ericomp 
Twinblock thermocycler . If PCR amplification was to be run 

30 on cDNA made by method 2, 40 /il of additional reaction mix 
was directly added to the 10 ^1 of the cDNA reaction 
bringing the total volume up to 50 Atl. The final 
concentrations of reagents in the 50 ^^l were 70 mM Tris.HCl 
pH 8.8, 40 mM KCl, 0*1% Triton X- 100, 2.25 units Tag 

35 polymerase (Pharmacia), 0.2 micromolar (fiK) each primer, 
200 ^M each dNTP, and 0.8 mM MgCl2 . 
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If PGR was to be performed on cDNA made by method 
1 above, 5 to 10 /xl of the resuspended cDNA was added to 40 
to 45 /xl containing final concentrations of the following: 
70 mM Tris.HCl pH 8.8, 40 mM KCl, 0.1% Triton X-100, 2.25 
5 units Tag polymerase, 0.2 each primer, 200 /xM each dNTP, 
and 0.8 mM MgCl2 . 

The reactions were then subjected to the 
following PGR cycles. 

1 minute at 94*'G. 
10 3 0 seconds at 55 ^'G. 

2 minutes at 72**G. 

The above cycle was repeated 30 to 4 0 times. 10 
Ml of each reaction mix was run on a 6% polyacrylamide gel 
and screened for the presence of a 247 bp PGR fragment 
15 which would indicate the presence of the properly spliced 
mRNA for TSHB. 

PGR RESULTS FOR AMPLIFIGATION OF TSH6 RNA FROM GH3 GELLS 
AND RAT PITUITARY GLAND TOTAL RNA 

To determine whether GH3 cells normally 

20 

synthesize TSHB RNA, cDNA from untransf ected GH3 cells as 
well as cDNA from rat pituitary glands was subjected to the 
above PGR reaction conditions. The correct 247 bp band 
indicative of the presence of TSHB mRNA was visible in the 
positive control of the rat pituitary gland sample but no 
band was visualized from the GH3 cell total RNA sample even 
after 60 cycles (data not shown) . 

TRANSFEGTION RESULTS 

The number of G418 resistant colonies present on 
the 10 cm dishes were tabulated between 14 and 21 days 
after addition of G418 to the media. 



35 
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Transf ec-tion Method 



Calcium phosphate 1 
Calcium phosphate 2 
Electroporation 1 
Electroporation 2 



Colonies per 10 cm dish 
PRSVCATNEO PRSVCATNEOTSHB 3 - SXR A 1 

Apal cut Aat2 cut 



48 



13 
21 
1295 
1051 



29 
58 
415 
723 



10 



15 



20 



Total RNA was harvested from the colony pools 
contained in the 24 well plates as described above. cDNA 
was made from these RNA preps and subjected to PCR 
amplification. The nvimber of positive colonies producing 
TSH6 mRNA was determined by the presence of a 247 bp 
fragment as visualized on a polyacrylamide gel. Each of 
the pools screened contained between 10 and 70 colonies. 
The estimated number of colonies per pool per transfection 
was used to approximate the number of G418 resistant GH3 
cell clones in which TSHB gene transcription was activated. 
If a pool tested positive, it was assumed that this 
represented one positive colony present in that particular 
pool . 



plasmid G418 resi stant colonipc; TSHB RNA positive 

25 pRSVCATNEO 60 0 

PRSVCATNEOTSHB3-5XBA1 4942 3 

(Aat2 digested) 

PRSVCATNEOTSHB3-5XBA1 8580 6 

(Apal digested) 

30 These results demonstrate the successful 

activation of the normally transcriptionally silent TSHB 
gene by the method of the present invention. While the 
number of colonies that are positive for TSHB transcription 
is small compared to the nximber of colonies that are G418 

35 resistant (approximately one out of every 10^ G418 
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resistant colonies) , this result is generally consistent 
with rates reported for other homologous recombination 
experiments (Michael Kriegler, Gene Transfer and Expression 
A Laboratory Manual . Stockton Press, New York, NY (1990), 
5 pp. 56 - 60)'. It has been generally observed that the 
homologous recombination rate seems to be proportional to 
the rate of transcription of the targeted gene (M. Frohman 
and G. Martin, Cell . 56:145 (1989); S. L. Mansour et al, 
Nature . 336:348 (1988)). It should be noted that the rate 

10 which has been demonstrated is three orders of magnitude 
higher than what might be expected for random mutation 
turning on the TSHB gene. 

To ensure that the results for each colony pool 
were reproducible and that the activation of RNA 

15 transcription was stable, colony pools previously frozen 
away corresponding to pools which tested positive in the 
first screening were thawed and expanded in culture. The 
freshly thawed GH3 positive pools were seeded in T 25 
tissue culture flasks and expanded until the cells reached 

20 70% to 80% confluence. 80,000 cells were then plated in 24 
well plates from each flask and grown until they were 50% 
to 70% confluent. RNA was extracted from the cells, 
converted into cDNA, and screened once again for the 
presence of TSHB RNA by running 10 ^1 of each PGR reaction 

25 on a 6% polyacrylamide gel. Figure 15 shows the results 
of representative PGR reactions from the second screening 
as visualized on a polyacrylamide gel by ethidium bromide 
staining and fluorescence. Lanes 1, 2, and 3 contain the 
PGR reactions run on cDNA from GH3 cells which had been 

30 transfected by pRSVCATNEO. pRSVCATNEO contains no regions 
of homology to TSHB and thus is not capable of activating 
the TSHB gene by homologous recombination. As can be seen 
on the gel in figure 15, there are no bands corresponding 
to 247 bp in those lanes indicating that the TSHB gene is 

35 not activated. Lane 6 also contains a negative control. 

In that lane three pools were combined from samples of GH3 
cells which had been transfected with pRSVGATNEOTSHB3-5XbaI 
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(Apal cut) but which were negative for transcription of the 
TSHB gene on the first screening. The absence of the 247 
bp fragment in lane 6 demonstrates that the presence of the 
transfected pRSVCATNEOTSHB3-5XbaI (Apal cut) plasmid 
integrated randomly in the genome is not capable of 
producing the 247 bp TSHB PGR fragment. Lanes 7, 8, 9, and 
10 include PGR reactions run on cDNA made from total rna 
harvested from rat pituitary glands in quantities per 
reaction of 25 nanograms, 100 nanograms, 200 nanograms, and 
400 nanograms, respectively. The presence in' these lanes 
of the expected 247 bp band, produced from cDNA prepared 
from a rat tissue which normally expresses TSHB, showed 
that the PGR reaction conditions were correctly 
optimized and that the PGR band obtained in lanes 4 and 5 
containing the homologous recombination TSHB positives is 
of the correct size. Two pools transfected with 
pRSVCATNEOTSHB3-5XbaI (Apal cut) which were positive in the 
first screening, ApaI-107 in lane 4 and ApaI-136 in lane 5, 
once again tested positive for TSHB gene activation as 
demonstrated by the presence of the correct TSHB PGR band 
amplified from cDNA made from the total RNA extracts from 
those pools proving that transcription of TSHB gene has 
been stably activated. The presence of bands at 247 bp in 
lanes 4 and 5 containing RNA from previous positives ApaI- 
107 and ApaI-13 6 and the absence of bands in the negative 
controls of pRSVGATNEO transfected GH3 cells in lanes 1-3 
and the pRSVGATNEOTSHB3-5XbaI (Apal cut) negatives in lane 
6 demonstrated that the production of TSHB RNA in a cell 
line that does not nomally produce that RNA has been 
stably turned on by homologous recombination. 

The present invention is not limited to the cell 
line that is described herein. All cell lines have genetic 
information which is normally silent or inert. Most are 
able to express only certain genes. However, a normally 
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transcriptionally silent or inert gene of any such cell 
line can be activated to express the gene product in 
accordance with the present invention and any gene of the 
genome may have its expression characteristics modified in 
5 accordance with the present invention. Even previously 

transformed cell lines can be used as long as the previous 
transformation did not disrupt the gene of interest. The 
source of the cell line is not important. The cell line 
may be animal or plant, primary, continuous or immortal. 
10 Of course, it is desirable that any such cell line be 
stable and immortal, so that after treatment with the 
technique in accordance with the present invention, 
expression can be commercialized. Cloned microorganisms, 
whether prokaryotic or eukaryotic, may also be treated by 
15 the technique of the present invention. 

While the present invention has been preferably 
described with respect to the expression of a normally 
transcriptionally silent or inert gene, the technique of 
the present invention is also applicable to the 
20 modification of the expression characteristics of a gene 
which is naturally expressed in the host cell line. For 
example, if it is desired to render the expression of a 
gene dependent upon culture conditions or the like so that 
expression can be turned on and off at will, an appropriate 
25 DNA regulatory segment, such as a regulatable promoter, can 
be inserted which imparts such characteristics, such as 
repress ibility or inducibility . For example, if it is 
known that the cell type contains nuclear steroid 
receptors, such as estrogen, testosterone or 
3 0 glucocorticoid, or thyroxin receptors, one could use the 

steroid or thyroxin response elements as region F. Such a 
response element is any DNA which binds such receptor to 
elicit a positive response relative to transcription. Even 
if a cell is not naturally responsive to glucocorticoids, 
35 for example, a piece of DNA which encodes the 

glucocorticoid receptor could be added to the construct, or 
othearwise inserted somewhere in the genome, so as to make 
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the cell responsive to glucocorticoids. The use of a 
regulatable promoter could be desirable whether or not the 
gene of interest is normally transcriptionally silent. 
Other kinds of regulation can also be obtained by targeting 
the appropriate DNA regulatory segment to the exact 
position of interest by means of the process of the present 
invention. 

Thus, while stimulation of expression of normally 
transcriptionally silent genes is the preferred application 
of the present invention, in its broadest sense it is 
applicable to the modification of expression 
characteristics of any gene endogenous to the host cell 
line. 

The specific technique of homologous 
recombination is not, per se, a novel part of the present 
invention. Such techniques are known and those of ordinary 
skill in this art will understand that any such technique 
can be used in the present invention as long as it permits 
targeting of the DNA regulatory sequence to the desired 
location with respect to the gene of interest. While a 
preferred technique is disclosed, using a linearized 
construct with two homologous regions on either end of the 
sequences to be inserted, any other technique which will 
accomplish this f\inction, as, for example, by using 
circular constructs, is also intended to be comprehended by 
the present invention. The critical feature of the present 
invention is the use of homologous recombination techniques 
to insert a DNA regulatory sequence which causes 



W6 91/09955 



PCr/US90/07642 



modification of expression characteristics in the cell line 
or microorganism being used, operatively linked with a gene 
in the genome of the cell line, preferably one which is 
normally transcriptionally silent, or to insert an 
5 amplifiable sequence, without a regulatory sequence, 

sufficiently near a gene in the genome of the cell line 
which already transcribes as to cause amplification of such 
gene upon amplification of the amplifiable sequence. It is 
not absolutely necessary that a selectable marker also be 

10 included • Selection can be based solely on detection of 

the gene product of interest or mRNAs in the media or cells 
following insertion of the DNA construct. Furthermore, in 
the embodiment in which a regulatory sequence is being 
inserted, amplification, while desired, is not critical for 

15 operability. The same is true for the negative selection 
gene which makes the screening process easier, but is not 
critical for the success of the invention. Thus, the basic 
embodiment requires only insertion of the DNA regulatory 
segment or the amplifiable segment in the specific position 

20 desired. However, the addition of positive and/ or negative 
selectable marker genes for use in the selection technique 
is preferred, as is the addition of an amplifiable gene in 
the embodiment in which a regulatory segment is being 
added. 

25 The term "modification of expression" as used 

throughout the present specification and claims, is hereby 
defined as excluding termination of expression by inserting 
by homologous recombination a mutation, deletion, stop 
codon, or other nucleotide sequence, including an entire 

30 gene, into the gene of interest, so as to prevent the 

product of interest from being expressed. The prior art 
teaches the use of homologous recombination to insert 
specific mutations and the expression of a cell product may 
have inherently been terminated by means thereof (see, for 

35 example, Schwartzberg et al, PNAS (USA), 87:3210-3214 
(1990)). The present invention is not intended to 
encompass such a procedure. In the present invention the 
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"modification of expression" is accomplished by means of 
inserting regulatory and/or amplification regions at a 
specific desired location by means of homologous 
recombination. The preferred modifications are those which 
activate and/ or enhance expression of the product of 
interest . 

Whenever the present specification uses the 
phrase that a DNA regulatory segment is "operatively linked 
with" a gene, such terminology is intended to mean that the 
DNA regulatory segment is so disposed with respect to the 
gene of interest that transcription of such gene is 
regulated by that DNA regulatory segment. The regulatory 
segment is preferably upstream of the gene, but may be 
downstream or within the gene, provided that it operates to 
15 regulate expression of the gene in some way. The DNA 

regulatory segment may be a promoter, terminator, operator, 
enhancer, silencer, attenuator, or the like, or any 
combination thereof. 

Whenever the terms "upstream" or "downstream" are 
20 used in the present specification and claims, this is 

intended to mean. in the 5 '-direction or the 3 • -direction, 
respectively, relative to the coding strand of the gene of 
interest . 

The foregoing description of the specific 
25 embodiments so fully reveals the general nature of the 

invention that others can readily modify and/or adapt such 
specific embodiments for various applications without 
departing from the generic concept. Any such adaptations 
and modifications are intended to be embraced within the 
meaning and range of equivalents of the disclosed 
embodiments. It is to be understood that the phraseology 
and terminology employed herein are for the purpose of 
description and not of limitation. 



30 



35 
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WHAT IS CLAIMED IS: 

1. A method of activating a normally 
transcriptionally silent gene within the genome of a cell 
line or microorganism so as to enable said cell line or 

5 microorganism to express the gene product of said gene, 
comprising inserting a DNA construct into said genome by 
homologous recombination, said DNA construct comprising a 
DNA regulatory segment capable of stimulating expression of 
said gene when operatively linked thereto and a DNA 
10 targeting segment homologous to a region of said genome 

within or proximal to said gene, wherein said construct is 
inserted such that said regulatory segment is operatively 
linked to said gene of interest • 

2. A method in accordance with claim 1, 21, or 
15 22, wherein said DNA construct comprises two DNA targeting 

segments, each homologous to a region of said genome within 
or proximate to said gene, one of said targeting segments 
being upstream of said regulatory segment and the other of 
said targeting segments being downstream of said regulatory 
20 segment, 

3. A method in accordance with claim 1, 2, or 
21, wherein said DNA construct additionally comprises at 
least one expressible selectable marker gene disposed so as 
to be inserted with said regulatory segment. 

25 4. A method in accordance with claim 1, 2, 3, 

21, 22, or 23, wherein said DNA construct additionally 
comprises a negative selectable marker gene disposed with 
respect to said targeting segment so as not to be inserted 
when said construct is properly inserted by homologous 

3 0 recombination, whereby said negative selectable marker is 
not expressed in cells in which said DNA construct is 
properly inserted. 

5. A method in accordance with claim 1, 2, 3, 4, 
or 21, wherein said DNA construct additionally comprises ah 

35 expressible amplifiable gene disposed so as to be inserted 
with said regulatory segment. 
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6, A method accordance with claim 1^ 2, 3, 4, 
5, 21, 22, or 23, wherein said cell line or microorganism 
is a eukaryotic cell line, 

? • A method in accordance with claim 6, wherein 
said cell line or microorganism is an animal cell line. 

8. A method in accordance with claim 6, wherein 
said cell line or microorganism is a mammalian cell line, 

9. A method in accordance with claim 6, wherein 
said cell line or microorganism is a plant cell line, 

10. A method in accordance with claim 2, and 
additionally for causing expression of said gene product, 
further including the steps of,, following said inserting 
step : 

selecting clones of said cell line or 
microorganism which express the product of said selectable 
marker gene; 

cultivating the selected clones under conditions 
sufficient to permit expression of said gene product; and 
collecting said gene product. 

11. A method in accordance with claim 10, 
wherein said selectable marker gene is the neomycin 
resistance gene and said selecting step comprises selecting 
those clones having neomycin resistance. 

12. A method in accordance with claim 10 or 11, 
wherein said DNA construct additionally comprises a 
negative selectable marker gene disposed with respect to 
said targeting segment so as not to be inserted when said 
construct is properly inserted by homologous recombination, 
whereby said negative selectable marker is not expressed in 
cells in which said DNA construct is properly inserted, and 
said selecting step further includes selecting those clones 
which do not express said negative selectable marker gene. 

13. A method in accordance with claim 12, 
wherein said negative selectable marker gene is the Herpes 
Simplex Virus thymidine kinase gene and said selecting step 
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includes selecting those clones which survive exposure to a 
media that kills cells which express said gene* 

14. A genome having a DNA regulatory segment 
operatively linked with a naturally occurring gene at an 

5 insertion site characterized by a predetermined DNA 

sequence, said DNA regulatory segment not being naturally 
occurring at said location in the genome. 

15. A cell line or microorganism capable of 
expressing 4 gene product by a normally transcriptionally 

10 silent gene within the genome of said cell line or 

microorganism, said genome having inserted therein a DNA 
regulatory segment operatively linked with said normally 
transcriptionally silent gene, said DNA regulatory segment 
being capable of promoting the expression of a gene product 

15 by said cell line or microorganism. 

16. A cell line or microorganism in accordance 
with claim ^5 or 25, wherein said DNA regulatory segment is 
one which is capable of promoting the expression of a gene 
product normally expressed by said cell line or 

2 0 microorganism . 

17. A cell line or microorganism in accordance 
with claim 16, wherein the inserted DNA regulatory segment 
is part of a DNA construct comprising said DNA regulatory 
segment and at least one selectable marker gene. 

25 18. A cell line or microorganism in accordance 

with claim 17, wherein said DNA construct additionally 
comprises an amplifiable gene. 

19. A method of obtaining a gene product from a 
cell line or microorganism, comprising culturing a 

3 0 differentiated cell line or microorganism in accordance 

with claim 15-18 or 24-26 under conditions which permit 
expression of said gene product, and collecting said gene 
product . 

35 
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20. A DNA construct for insertion into a 
predetermined host cell line or microorganism, comprising a 
DNA regulatory segment capable of modifying the expression 
characteristics of genes in the host cell line or 
microorganism when operatively linked thereto and a DNA 
targeting segment homologous to a region of the genome of a 
preselected gene within the host cell line or 
microorganism . 

21. A method of modifying the expression 
characteris^iics of a gene within the genome of a cell line 
or microorganism, comprising inserting a DNA construct into 
said genome by homologous recombination, said DNA construct 
comprising a DNA regulatory segment capable of modifying 
the expression characteristics of said gene when 

15 operatively linked thereto, as compared to its existing DNA 
regulatory segment, and a DNA targeting segment homologous 
to a region of said genome within or proximal to said gene, 

wherein said construct is inserted such that said 
regulatory segment is operatively linked to said gene of 

20 interest. 

22. A method of modifying the expression 
characteristics of a gene within the genome of a cell line 
or microorganism, comprising inserting a DNA construct into 
said genome by homologous recombination, said DNA construct 

25 comprising an expressible, amplifiable gene capable of 

amplifying said gene when inserted in sufficiently close 
proximity thereto, and a DNA targeting segment homologous 
to a region of said genome within or proximal to said gene, 
wherein said construct is inserted such that said 

30 amplifiable gene is in sufficiently close proximity to said 
gene of interest to cause amplification thereof when said 
amplifiable gene is amplified. 

23. A method in accordance with claim 22, 
wherein said DNA construct additionally comprises at least 

35 one expressible selectable marker gene disposed so as to be 
inserted with said expressible, amplifiable gene. 
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24. A cell line or microorganism capable of 
enhanced expression of a gene product compared to the cell 
line or microorganism from which it is derived, said gene 
product beifig the expression product of an endogenous gene 
5 within the genome of said cell, said genome having inserted 
therein in an operative manner, at or near said endogenous 
gene, an exogenous DNA regulatory segment and/ or 
amplifiable gene capable of enhancing the expression of 
said gene product by said cell line or microorganism. 
10 25. A cell line or microorganism in accordance 

with claim 24, wherein said exogenous DNA regulatory 
segment and/or amplifiable gene is an exogenous DNA 
regulatory segment. 

26. A cell line or microorganism in accordance 
15 with claim 24, wherein said exogenous DNA regulatory 

segment and/ or amplifiable gene is an exogenous amplifiable 
gene. 

27. A DNA construct for insertion into a 
predetermined host cell line or microorganism, comprising 

20 an expressible, amplifiable gene capable of amplifying a 
gene in the host cell line or microorganism when inserted 
in sufficiently close proximity thereto, and a DNA 
targeting segment homologous to a region of the genome of a 
preselected gene within the host cell line or 

2 5 microorganism . 

28. A method in accordance with claims 1, 2, 3, 
4, 5, 21, 22, or 23, wherein said cell line or 
microorganism is a microorganism. 

30 



35 



aNS0OCID:<WO 9109955A1> 



wo 91/09955 



PCr/US90/07642 
» « 



1 / 14 



FIG. I 




SUBSTITUTE SHEET 



wo 91/09955 



PCT/US90/07642 



2/14 

FIG. 2 A 

RANDOM INTEGRATION 



E 


E' W/MM/M. 


D 


D' 


C 


C 


F 


WMmm 



RANDOM INTEGRATION 



' wmw/ 



FIG. 2B 

HOMOLOGOUS RECOMBINATION 





HOMOLOGOUS RECOMBINATION 



TRANSCRIPTION 



SUBSTITUTE SHEET 



BNSOOCID: <WO 9109955A1> 



wo 91/09955 



PCr/US90/07642 



3/14 

F I G . 3 

HOMOLOGOUS RECOMBINATION CONSTRUCT FOR RAT 

TSH BETA 



FIG. 4 




GENE OF INTEREST ( 



HOMOLOGOUS RECOMBINATION 



GENE OF INTEREST / 



B 



C 



B 



SUBSTITUTE SHEET 



wo 91/09955 



PCT/US90/07642 



4/14 

FIG. 5 



Hind3 (5025) 




Aat2(2567) 



ARROW INDICATES SENSE DIRECTION 



SUBSTITUTE SHEET 



9109955A1> 



wo 91/09955 PCT/US90/07642 

* 

5/14 

FIG. 6 




• SITE NO LONGER EXISTS 
ARROW INDICATES SENSE D IRECTION 



SUBSTITUTE SHEET 



wo 91/09955 



PCr/US90/07642 



6/14 

FIG. 7 



Bg I 2 (5229) 




Apa1 (3112) Aat2(2561) 



ARROW INDICATES SENSE DIRECTION 



SUBSTITUTE SHEET 



9109955A1> 



wo 91/09955 



PCT/US90/07642 



7/14 



FIG. 8 



Aat II (5041) 




Hindlll (400) 

SphI (406) 

PstI (4 12) 
Sail (418) 

Nde 1»/Xba1 (424) 



Hind3VXba1(1002) 
Bgl2VBamHr(i006) 



EcoRI(2875) 
Sad (2869) 
Kpnl(2863) 
S ma I (2859) 

BamHI(2854) 



2073) 



PstI (2206) 



•SITE NO LONGER EXISTS 
ARROW INDICATES SENSE DIRECTION 



SUBSTITUt&SHEET 



8NSOOC1D: <WO 9109955A1> 



wo 91/09955 



PCr/US90/07642 



8/14 



FIG. 9 



Aatll(4252) 




Hindlll{400) 

SphI (406) 

PstI (412) 
Sall(418) 

NdelVXbal (424) 



Smal VXhol (2080) 

Kpnl (2084) 
Sad ( 2090) 

EcoRI (2096) 



Hind 3VX bald 002) 
Bgl2*/BannHr(1006) 



* SITE NO LONGER EXISTS 
ARROW INDICATES SENSE DIRECTION 



SUBSTITUTE SHEET 



wo 91/09955 



PCT/US90/07642 



9/14 

FIG. I O 



Hind 3 (6660) 




Aat2{4202) 



* SITE NO LONGER EXISTS 
ARROW INDICATES SENSE DIRECTION 



SUBSTITUTE SHEET 



BNSOOCIDkWO 9109955A1> 



wo 91/09955 



PCr/US90/07642 



10 / 14 



FIG. II 



RAT TSH BETA 

Xbal XbaT Hind3 

I 23 



2255bp 



10600 bp 

I 

4700 bp 

I5300bp 



SUBSTlTUtESHEET 

8NSOOCID:<WO 91099S5A1> 



Xbal 



wo 91/09955 



PCr/US90/07642 



11 / 14 

FIG. 12 



Xbar/Hind3(8916) 




Apa1(4753) Aat2(4202) 



•SITE NO LONGER EXISTS 
ARROW INDICATES SENSE DIRECTION 



SUBSTITUTE SHEET 

BM'^OOCID: <WO 91099S5A1> 



wo 91/09955 



PCT/US90/07642 



12 / 1 4 



FIG. 13 



Apal (17331) 
Hind3(17281) 



poly A (15760) 
BamH1(15647) 

Apa1(153 72) 
Aat2(14821) 



Sal 1(12832) 



Xbal VHind3(19535) 

Smar / Xhol*/San*(5 78) 
Bgl2VBamH1''(l640) 

Sail (2215) 
Bam H 1(2220) 

Hincl3(2681) 




Hind3(7093) 



Hlnd3(11253) 



Apal (8943) 



* SITE NO LONGER EXISTS 
ARROW INDICATES SENSE DIRECTION 



SUBSTITUTE SHEET 



BNSOOCID: <WO 9109955A1> 



wo 91/09955 PCT/US90/07642 

FIG. 14 ^3/14' 

LOCATION OF PRIMERS FOR PGR A M PLI Fl CATION OF TS H BETA 

5' 

g gcacgcctctgaotgtggaoag gacacttat gagct ctgtggtctttccctctgattt 
a g C ATGAATGCTGTCGTT CT C T TTTCCGT6CTTTTCGCTCTTGCTTGTGGGCA/5GTCT 

5' TSHB5 3' 

A G T ATATGATGTACGTGGACAGG 
C ATCGTTTTGTATTCCCACTG AG T ATATGATGTACGTGGACAGGAGAGQGTGTGCCTAC 

TG C CTG ACCATC A AC AC CACCATCTGC GCTGGG TATTGTATG ACACGG g t at g 1 1 ggt 

c act gcg tt tc 1 1 t t agct g t-a a a t tg tacaggt ctaaagtt gtctgttaatattttag 
a a aggaagtggga t aaa tc ata gtctcctctt tg ggaagccaagcacactgctttcoga 
a t t a taatta tg t CO tt c t acac a g aaaaagta cagotacat t g taacagtttacccta 
aagtgtttgttctgctcaatgg t ag a tgagaagaaagtgtccttttttgtctctgaggg 
g t t aag tgtagat gtgtggg to a c a gagctcaggagtcctt taagatcatcaggaaaca 
aagggatattagtcattctattacac toogttg c atgcagtttatcatgt toagatctc 
ttttcttccacagG ATAT C A A T G GC A AACTGTTTCTTCCCAAGTACGCACTCTCTCAG 

K)t»XXXit)t X X X XXXitX^eXX XXJtXXXXJtXXXXXXXXXXXXXXXX-X 

G ATG TCTGTACATACAGAG ACT TC ACCTACASAACGGTGGAAAIACCGGGffTGCCCACA 

X XXXXXXXXXX^tX *X**^tXXiH( *X »^**«XXXXX)**XX*X)tXX*XXXXXXXXX^XX)tX 

C C ATGTTGCTCCTTATTTC TC C T A C CCCGTTGCC CTGA6CT3CAAGTGTCGCAAGTC1A 

)t*XXX)tXXXX*^t*)tX-XX XX X«XXX X*Xit»3tX)tXXXJtJtXXXXXXXXX*XXXXX*XXXX 

G GACTCGACGTTCACACCGTTCAC 
3' TSHB3 5' 

A CACTG ACTACAGCG ACTG TA C A C A CG AGGCTGTCAAAACCAACTACTGCACCA/SGCCA 
C AG AC ATTCTATCTGGGGG GATTTTCTGGTTAACTGTAATGGCAATGCAATCTG GTTAA 
ATG TGTTTACCTGGAATAGAACTAATAAAATATCATTGAT atgtcttgcGtgccattt 

QQ tccataggcaca t ccacaaggcat tagagogc t tacacaact t tagaagca gaggcg 

3' 
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